SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: \Schering Corporation 

(ii) TITLE OF INVENTION: Thioredoxin/ Heterologous Protein 

Bacterial Expression System 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schering-Plough Corporation 

(B) STREET: 2000 Galloping Hill Road 

(C) CITYAKenUworth 

(D) STATe\ New Jersey 

(E) COUNTRY: U.S.A. 

(F) ZIP : 07033-0530 

(v) COMPUTER READABUE FORM: 

(A) MEDIUM TYPE: diskette 

(B) COMPUTERAApple Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.5.3 

(D) SOFTWARE: Microsoft Word 5.1a 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 60/011,606 

(B) FILING DATE: 30-APR-1996 

(viii) ATTORNEY/AGENT INFORMATM 

(A) NAME: Thampoe, Imma^J.. 

(B) REGISTRATION NUMBER: 36,322 

(C) REFERENCE DOCKET NUiyiBER: JB0600Q 

(2) INFORMATION FOR SEQ ID NO:l: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCEtoESCRIPTION: SEQ ID NO:l: 
CCTGTGGAGT TACATATGAG CGATAAAATT 3 0 

(2) INFORMATION FOR SEQ ID NO:2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 bases 

(B) TYFt: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: W 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GCACCCAACA TGCAAGGATC CTTACGCCAG ATTAGCATCG AGGAACT 47 

(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33&base pairs 

(B) TYPE: nucleic aVid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lin\ar 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SE£ ID NO:3: 

ATGAGCGATA AAATTATTCA CCTGACTGAt GACAGTTTTG ACACGGATGT 5 0 

ACTCAAAGCG GACGGGGCGA TCCTCGTCGA\ TTTCTGGGCA GAGTGGTGCG 100 

GTCCGTGCAA AATGATCGCC CCGATTCTGG \ATGAAATCGC TGACGAATAT 150 

CAGGGCAAAC TGACCGTTGC AAAAC TGAAC ATCGATCAAA ACCCTGGCAC 2 00 

TGCGCCGAAA TATGGCATCC GTGGTATCCC GACTCTGCTG CTGTTCAAAA 250 

ACGGTGAAGT GGCGGCAACC AAAGTGGGTG CACTGTCTAA AGGTCAGTTG 3 00 

AAAGAGTTCC TCGATGCTAA TCTGGCGTAA GGATCC 336 


(2) INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bases pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 


ATGAGCGATA 
ACTCAAAGCG 
GTCCGTGCAA 
CAGGGCAAAC 
TGCGCCGAAA 
ACGGTGAAGT 
AAAGAGTTCC 


AAATTATTCA 
GACGGGGCGA 
AATGATCGC 
TGACCGTTGC 
TATGGCATCC ' 
GGCGGCAACC 
TCGAGGCTAA 


CCTGACTGAC 
TCCTCGTCGA 
CCGATTCTGG 
AAAACTGAAC 
GTGGTATCCC 
iGTGGGTG 
TfcTGGCGTAA 


GACAGTTTTG 
TTTC TGGGC A 
ATGAAATCGC 
ATCGATCAAA 
GACTCTGCTG 
CACTGTCTAA 
GGATCC 


ACACGGATGT 50 
GAGTGGTGCG 100 
TGACGAATAT 150 
ACCCTGGCAC 200 
CTGTTCAAAA 250 
AGGTCAGTTG 3 00 
336 


(2) INFORMATION FOR SEQ ITS> NO:5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 b\ses pairs 

(B) TYPE: nucleic aci 

(C) STRANDEDNES& single 

(D) TOPOLOGY: linea\ 
(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ to NO:5: 


GAT AAT ATT CTG GCT GGT TCT GGT 
Asp Asn Asn Leu Ala Gly Ser Gly 
1 5 


GGT GAT GAC GAT GAC AAG 45 
sr Gly Asp Asp Asp Asp Lys 
10 15 


GGT CCT GTT CCG CCG TCT ACC GCT CTG CGT GAG CTC 
Gly Pro Val Pro Pro Ser Thr Ala Leu\Arg Glu Leu 
20 \ 25 


81 


(2) INFORMATION FOR SEQ ID NO:6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 


Asp Asn Asn Leu Ala Gly Ser Gly Ser Gly Asp yVsp Asp Asp Lys 
1 5 10 \ 15 

Gly Pro Val Pro Pro Ser Thr Ala Leu Arg Glu L^u 
20 25 
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(2) INFORMATION FOR SEQ ID NO:7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH\52 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAAGGAGGCT GATTAAATGG GTCCGaTTCC GCCGTCTACC GCTCTGGAGC 50 
TC \ 52 

(2) INFORMATION FOR SEQ ID NO:8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single\ 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO\8: 

AAGGAGGCTG ATTAAATG 

(2) INFORMATION FOR SEQ ID NO:9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

AAGGAGGCTG ATTAATG 


- 12 - 

(2) INFORMATION FOR'SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: niicleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA^ 
(xi) SEQUENCE DESCRIPTION SEQ ID NO: 10: 

AAGGAGGTTT AATG 14 

(2) INFORMATION FOR SEQ ID Nfo:ll: 
(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 6 amino £\cids 

(B) TYPE: amino acid 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION SEQ iaNO:ll: 

Leu Asp Ala Asn Leu Ala 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION SEQ ID NO:12: 


CTC GAT GCT AAT CTG GCG TAA 21 
Leu Asp Ala Asn Leu Ala 6 


(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION SEQ ID NO:13: 


CTC GAG GCT AAT 
Leu Glu Ala Asn 


GCG TAA 21 
Ala 6 


(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino aci<J 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION SE(AlD NO: 14: 


Leu Glu Ala Asn Leu Ala 


